A Sketch Algorithm for Estimating Two-Way and Multi-Way Associations
نویسندگان
چکیده
منابع مشابه
A Sketch Algorithm for Estimating Two-Way and Multi-Way Associations
We should not have to look at the entire corpus (e.g., the Web) to know if two (or more) words are strongly associated or not. One can often obtain estimates of associations from a small sample. We develop a sketch-based algorithm that constructs a contingency table for a sample. One can estimate the contingency table for the entire population using straightforward scaling. However, one can do ...
متن کاملUsing Sketches to Estimate Two-way and Multi-way Associations
We should not have to look at the entire corpus (e.g., the Web) to know if two (or more) words are associated or not. A powerful sampling technique called Sketches was originally introduced to remove duplicate Web pages. We generalize sketches to estimate contingency tables and associations, using a maximum likelihood estimator to find the most likely contingency table given the sample, the mar...
متن کاملComplexity of Estimating Multi - way Join
In a real life environment, spatial data is highly skewed. In general, there are two kinds of skews in spatial data. One is the placement skew and the other is the area skew. This paper introduces methods and the complexity of estimating the result sizes of the multi-way join for the area skewed spatial data. Especially, this paper describes the number and sort of the statistics which the optim...
متن کاملA Third Way for Health Policy?
Economics has hit the mainstream in the last decade with popular books like Freakonomics and The Undercover Economist reaching the masses. These authors have used their toolkits far beyond the narrow scope of money and finance and answered questions pertaining to anything from social policy to demographics to crime. Their appeal has largely been their ability to explain that small underlying fo...
متن کاملMulti-granulation fuzzy probabilistic rough sets and their corresponding three-way decisions over two universes
This article introduces a general framework of multi-granulation fuzzy probabilistic roughsets (MG-FPRSs) models in multi-granulation fuzzy probabilistic approximation space over twouniverses. Four types of MG-FPRSs are established, by the four different conditional probabilitiesof fuzzy event. For different constraints on parameters, we obtain four kinds of each type MG-FPRSs...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Linguistics
سال: 2007
ISSN: 0891-2017,1530-9312
DOI: 10.1162/coli.2007.33.3.305